PolicyMiner: From Oysters to Pearls

نویسندگان

  • Hossein Rahmani
  • Christine Arnold
چکیده

Today, a historically unprecedented volume of data is available in the public domain with the potential of becoming useful for researchers. More than at any other time before, political parties and governments are making data available such as speeches, legislative bills and acts. However, as the size of available data increases, the need for sophisticated tools for web-harvesting and data analysis simultaneously grows. Yet, for the most part researchers who are developing these tools come from a computer science background, while researchers in the social and behavior sciences who have an interest in using such tools often lack the necessary training to apply these tools themselves. In order to provide a bridge between these two communities we propose a new tool called PolicyMiner. The objective of this tool is twofold: First, to provide a general purpose web-harvesting and data clean-up tool which can be used with relative ease by researchers with limited technical backgrounds. The second objective is to implement knowledge discovery algorithms that can be applied to textual data, such as legislative acts. With our paper we present a technical document which details the steps of data processing that have been implemented in the PolicyMiner. First, the PolicyMiner harvests the raw html data from publically available websites, such as governmental sites, and provides a unique integrated view for the data. Second, it cleans the data by removing irrelevant items, such as html tags and non-informative terms. Third, it classifies the harvested data according to a pre-defined standard conceptual hierarchy relying on the Eurovoc thesaurus. Fourth, it applies different knowledge discovery algorithms such as time series and correlation-based analysis to capture the temporal and substantive policy dependencies of the textual data across countries.

برای دانلود متن کامل این مقاله و بیش از 32 میلیون مقاله دیگر ابتدا ثبت نام کنید

ثبت نام

اگر عضو سایت هستید لطفا وارد حساب کاربری خود شوید

منابع مشابه

Fluorescence from Pearls of Freshwater Bivalves and Its Contribution to the Distinction of Mother Oysters Used in Pearl Culture

By measuring the fluorescence spectra of pearls from various species of mother oysters, a distinction has already become possible as to whether the pearls are from Pinctada fucata, Pinctada maxima, Pteria penguin or Pinctada margaritifera. In this study, the fluorescence of pearls from freshwater bivalves was measured. The results have made it possible to distinguish freshwater pearls from the ...

متن کامل

New developments in cultured pearl production: use of organic and baroque shell nuclei

Cultured pearls can be produced both with and without a nucleus. Marine pearl oysters that produce Akoya, South Sea and Tahitian cultured pearls typically use nuclei for their pearl products. The nucleus material used for these beaded cultured pearls is traditionally from freshwater Mississippi mussels. In recent years, there have been a number of attempts to use alternative pearl and shell mat...

متن کامل

Assessing Pearl Quality Using Reflectance UV-Vis Spectroscopy: Does the Same Donor Produce Consistent Pearl Quality?

Two groups of commercial quality ("acceptable") pearls produced using two donors, and a group of "acceptable" pearls from other donors were analyzed using reflectance UV-Vis spectrophotometry. Three pearls with different colors produced by the same donor showed different absorption spectra. Cream and gold colored pearls showed a wide absorption from 320 to about 460 nm, while there was just sli...

متن کامل

ذخیره در منابع من


  با ذخیره ی این منبع در منابع من، دسترسی به آن را برای استفاده های بعدی آسان تر کنید

برای دانلود متن کامل این مقاله و بیش از 32 میلیون مقاله دیگر ابتدا ثبت نام کنید

ثبت نام

اگر عضو سایت هستید لطفا وارد حساب کاربری خود شوید

عنوان ژورنال:

دوره   شماره 

صفحات  -

تاریخ انتشار 2013